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Self-similarity in the network traffic has been studied from several aspects: both at the user 
side and at the network side there are many sources of the long range dependence. Recently some 
dynamical origins are also identified: the TCP adaptive congestion avoidance algorithm itself can 
produce chaotic and long range dependent throughput behavior, if the loss rate is very high. In 
this paper we show that there is a close connection between the static and dynamic origins of self- 
similarity: parallel TCPs can generate the self-similarity themselves, they can introduce heavily 
fluctuations into the background traffic and produce high effective loss rate causing a long range 
dependent TCP flow, however, the dropped packet ratio is low. 



INTRODUCTION 

In a large number of internet traffic measurements 
many authors detected self-similarity 0, 0, S ^^If- 
similarity is usually attributed with heavy-tailed dis- 
tributions of objects at the traffic sources, e.g. sizes 
of transfered files 0, 0| or time delays in user 
interactions^. 

Recently the dynamical origins of self-similarity also 
attract increasing attention. Chaotic maps are known 
to generate fractal properties!^, and it has been shown, 
that the exponential backoff algorithm of the TCP proto- 
col can produce long range dependent traffic under some 
special circumstances. Veres[l3 emphasized the chaotic 
nature of the transport, and Feketej^ gave an analyt- 
ical explanation that this chaoticity is in close relation 
with the loss rate at highly utilized buffers due to packet 
drops. Guojoll gave a Markovian analysis of the backoff 
mechanism, and showed, that the self-similarity can be 
observed only at very high loss rates. 

In this paper we study a lossless network, where the 
traffic sources are not heavy-tailed distributed. Even 
though we observe self-similar traffic, which is generated 
by the communicating TCPs themselves. We show, that 
the random switching between the destinations of the 
flows together with the complex dynamics of interacting 
TCPs can lead to long range dependency. The interac- 
tion between TCPs at the buffers leads to backoff phases 
in the individual TCPs causing fractal traffic in an en- 
vironment where the loss rate is much below the lower 
bound of self-similarity. 

The outline of the paper is the following: First we 
introduce the concept of real and effective loss. Then 
we present a simple lossless model network, where self- 
similar traffic is observed, however the necessary condi- 
tions discusssed in the literature cited above, are not sat- 
isfied. Next we show that similar scenario can be found 
in real networks as well. Finally we conclude our results. 



REAL AND EFFECTIVE LOSS 

In the internet traffic many individuals use a common, 
finite resource to transmit information. If the resources 
are exhausted (e.g. routers are congested), data through- 
put is not possible. Therefore data transmitters should 
avoid congestion on shared information routes. Most 
of today's computer programs use similar algorithm to 
avoid congestion: they apply basicly the same TCP pro- 
tocol with slight differences. The common concept in ev- 
ery TCP is, that the data sending rate must be adapted 
to the actually available resources [13 Ol- Every TCP 
starts with a blind measuring phase (slow start), which 
exponentially reaches the maximum throughput rate. If 
the route, where TCP sends its data is stationary utilized, 
the algorithm works in a high throughput - slow adaption 
phase (congestion avoidance). The sending rate is var- 
ied around a high value by slowly increasing and rapidly 
decreasing it. Since every sent packet and its received 
acknowledgement continuously measure the transmission 
possibilities, this phase is very stable and can adapt to 
slowly varying situations 

If the route gets highly loaded, the TCP tries to clear 
the congestion by decreasing the sending rate. If the 
congestion is so high, that the TCP cannot guess the 
proper sending rate (acknowledgements do not arrive in 
the timeout period), the algorithm enters a very slow 
sending phase (exponential backoff). In this phase due 
to the lack of information an exponentially slowing al- 
gorithm is applied to try to find a new possible sending 
rate: the last package is resent after exponential increas- 
ing time intervals until an acknowledgement received or 
a maximum time interval is reached. 

In this paper we concentrate on the backoff phase of 
the TCP. We will show, that due to its blind nature, in 
this phase the TCP can feel higher loss rates as it really 
is. 

By the blindness of the TCP we mean the consequence 
of Karn's algor ithm|l3, which governs the backoff phase. 
Under normal transmission conditions TCP operates in 
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slow start or in congestion avoidance mode. In these 
modes the TCP estimates the optimal sending rate from 
the arrival time of the acknowledgements (ACK) by cal- 
culating the average round trip time (SRTT) and its av- 
erage deviation from the mean (MDEV). After each re- 
ceived ACK the TCP estimates the retransmission time- 
out (RTO). If this timeout is exceeded between sending 
a packet and receiving an acknowledgement for it, the 
TCP retransmits the packet assumed to be lost (by real 
loss or timeout). In this situation TCP applies the so 
called Karn's algorithm. 

The Karn's algorithm specifies that the acknowledg- 
ments for retransmitted data packets cannot be used to 
approximate the sending rate. Since for a received ACK 
packet one cannot decide if it is the ACK of the origi- 
nal or of the retransmitted packet, the round trip time 
(RTT) and so the sending rate cannot be estimated. The 
RTT can be calculated only for those packets, which are 
not retransmitted. So the TCP retransmits the packet 
and doubles the RTO calculated from the previous flow- 
informations (backoff phase). If the retransmitted packet 
timeouts again, the RTO is doubled and the packet is 
retransmitted again. The RTO is increased up to a max- 
imal value defined in the protocol. The TCP leaves the 
backoff phase only if the RTT can be estimated without 
ambiguity: the TCP must receive the acknowledgements 
of two consecutive sent packets. We will show a situation 
where this method reports reasonably higher loss rate for 
the TCP as it really is. 

We distinguish the loss into real or virtual. Real loss is 
referred to dropped packets which either are not arrived 
to the destination or the acknowledgment for it do not 
arrive to the sending TCP. We call a loss to be virtual 
if the acknowledgment arrives after the retransmission 
timeout (RTO) period, so the packet is retransmitted 
due to a spurious timeout. The effictive loss is assembled 
from the real and virtual losses. 

This distinction is important, since real loss emerges 
at highly congested buffers or at low quality lines (e.g. 
radio connections). These situations can be solved by 
improving the hardware conditions. In contrast, high 
virtual loss can evolve also under very good hardware 
conditions from heavily fluctuating background trafhc. 
On a route with several routers, where the packets can 
stay in a long queue, round trip times can change in a 
wide range depending on the queuing time. The queu- 
ing time depends on the saturation of the buffers on the 
route. If the background traffic fills the buffers at a vary- 
ing rate, the queueing time, and so the round trip time 
varies also. Bursty background traffic can fill the buffers 
rapidly to a high value, and after that can leave it to be 
cleared out. 

If the round trip time increases to such a high value 
due to a rapid filling up, that it becomes larger than the 
retransmission timeout value, a virtual loss occurs. After 
a burst which caused the virtual loss the clearing out of 



the buffer will lead to a shorter round trip time, which 
decreases the RTO value also. So for the next burst event 
the RTO is not large enough that the TCP can receive 
the ACK packet. So another virtual loss occurs without 
really loosing the sent packets. 

We will show in a model network and in a real measure- 
ment, that long range dependent traffic can emerge from 
the virtual losses due to the bursty background, however, 
real packet loss rate is so low, that one would expect a 
scalable simple traffic rate. 

CASE STUDY: SIMULATION 

In this section we present a simple model network, 
which shows self-similar traffic. Our model differs in sev- 
eral aspects from previous studies in the literature ex- 
cluding the known reasons of self-similarity. 

In our model three hosts transfer fixed sized files to 
each other through a router. All hosts transfer files with 
the same size. The topology of the model is shown in 
Fig. From each numbered sites of the network a TCP 
fiow is initiated to one of the other numbered sites. Each 
TCP flow passes through the router R using full duplex 
connections, so the flow of the acknowledgements do not 
interfere with the corresponding TCP data fiow. However 
data from other TCPs must share the same buffers and 
lines with acknowledgements. 




FIG. 1: The topology of the network model. The numbered 
circles represent hosts with TCP agents, and R denotes the 
router. 

We have chosen the network to be practically lossless: 
the buffer length in the router was set so large, that it 
is very improbable that TCP flows fill them. All the six 
buffers for the three full duplex lines are large enough 
to store all the files transfered between the hosts at a 
moment. There is no external packet loss on the lines as 
well. 

We will study the traffic properties on a line connecting 
a chosen host with the router. So the packet flows we 
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want to analyze are initiated from a fixed chosen host 
and they are built up from several successive TCP flows. 

In this topology the traffic is not always self-similar. 
The throughput of packets on a line can be regular if the 
destination of the individual TCP flows is chosen on a 
regular way. An example is shown in Fig. 12^^, where the 
TCP flows has been generated with the following sim- 
ple rule: from the host numbered hy i (i — 1 ... 3) the 
TCP sends packets to host {i mod 3) + 1. After a file 
has been transmitted, the host starts a new TCP flow 
immediately, there is no external random delay between 
the flows as it would be if we took the user behavior 
into account [3. . Under such regular sending rules the 
TCPs can utilize the available bandwidth and the traffic 
has a scalable periodic pattern. In Fig. |2t we show the 
congestion window for a host. 
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FIG. 2: The congestion window for one of the hosts, a) regular 
case b) random case 

We have implemented this simple network in the ns-2 
simulator (version number: 2. Ib7a)[3|- The link pa- 
rameters are: link rate 1Mbps, delay 1ms. File size of 
a TCP flow was 1000 packet. The receiver window was 
much larger than the file sizes. We have used the Reno 
version of TCP. 

If we introduce stochasticity in the sending rules we 
arrive at a non-scalable, long range dependent traffic. 
We applied the following rules to generate self-similar 
packet transport. All hosts send fixed size files to each 
other. Each host uses the same file size. If a host starts a 
new TCP, it randomly chooses to which host to send the 
file. After a transmission is completed, the host chooses 
the next destination immediately. The next destination 
is chosen randomly again without silent period between 
consecutive TCP flows. In Fig. |2d we show, that the 
stochasticity inhibits the TCPs to synchronize and the 
packet transport becomes irregular. The size of the trans- 
fered flies was chosen not too large to hinder the TCPs 
to adapt to each other. 

We investigate now the irregular packet traffic if 
it shows self-similarity or not. Self-similarity can be 
tested by investigating the second order statistics of the 



traffic'l3l. Consider a weakly stationary process X, 
with constant mean and autocorrelation function c{k). 

Let X"'{k) = ^ E,t^(fc-i)m+i X{i) denote the m ag- 
gregated series of X. The process X is self-similar if 
X « m^~^X™, and X is second order self-similar if 
m^^^ X"^ has the same variance and autocorrelation as 
X. The sign « expresses the fact that the equality can 
be satisfied only in a stochastic sense, exact equation can 
only be used for abstract mathematical objects. 

We have performed self-similarity test by using the 
variance time method. In Fig. |2| we plot the variance 
of the aggregated time series of the packets which scales 
as 

VariSX) = (X'"(fc) - (X'")fc)2 - m^". 

The fitted line in the figure indicates Hurst exponent 
H = 0.89 showing that the time series is self-similar since 
H > 0.5. We emphasize again, that the time series under 
consideration is built up from several consecutive TCP 
fiows. 

20 I ^ ^ ^ ^ , ^ 1 

15 - 

„ 10 - 
> 



- , 

-5 ' • • • • ' • 

-8 -6 -4 -2 2 4 6 

logt 

FIG. 3: Variance time plot of the aggregated traffic on a host- 
router hnk. The fitted line indicates H = 0.89. 

If a traffic is self-similar it shows properties which dif- 
fers from ones of memory-less (Markovian) processes: the 
dynamics of the system shows a strong history depen- 
dence. This long range dependence in the time evolution 
manifests itself typically in heavy-tailed distributions. A 
distribution is heavy-tailed if asymptotic decay of the dis- 
tribution is slow: a power-law with exponent less than 
two. Due to the always existing upper bounds of the 
measured data it is enough if the decay holds in the last 
decades below the maximum value: 

^(5* > s) - s"(i+"\as s G [10^" Sraax,Smax], and n > 2.5 . 

Such distributions are called heavy-tailed, since occur- 
rence of values much larger than the mean value of the 
observable s is not negligible in contrast to commonly 
used distributions as Gaussian or exponential. However 
in measured time series it can happen, that from the 
tail we cannot detect so many events as it is needed 
to plot a smooth distribution function. In these cases 
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it is favorably to work with the cumulative distribution 
Pc{s) — J P{s')ds' , which has an asymptotic behavior 
as Pc(s) ~ 1 — s^". Therefore one should use the inverse 
cumulative function 1 — Pc{s) to fit the parameter a on 
the logarithmic plot. 

Now we want to investigate if the long range depen- 
dency shows up in the traffic. We consider only the case 
when a destinations of the TCPs were chosen randomly. 
In Fig. 1^ we plot the inverse cumulative distribution of 
the packet inter arrival time on a down link. The distri- 
bution shows a slow decay with a = 1.18 which indicates 
that the fluctuating traffic has long silent periods. A 
power law decaying fluctuation can be characterized by 
the Hurst exponent if the traffic is viewed as an ON-OFF 
process. The silent periods are the OFF intervals of the 
process. The Hurst parameter is known for this type of 
process 0: H — which gives similar result as calcu- 
lated from the variance time plot in Fig. |3| 




FIG. 4: The distribution of the packet inter arrival times. 
The fitted line indicates H — 0.91. 

In the following we look for the origin of the long range 
dependence found above. In our model the topology and 
the TCP flow generating rules are chosen such a way, 
that the link/source conditions for self-similarity 0, 
are excluded. In network-side models the adaptive na- 
ture of TCP, namely the TCP backoff state mechanism 
is pointed out as a main origin of such behavior 0, 0. 
We investigate now if there is a close relation between 
the self-similarity of the traffic, and backing off of the 
TCP. 

In the backoff phase TCP retransmits the lost packet 
and doubles the RTO. TCP keeps track of the doubling 
times by a backoff variable b. In the non-backoff phases 
6 = 0, and in backoff b shows how many times the RTO 
counter has been doubled. Due to Karn's algorithm the 
RTO is doubled until two consecutive packet receives its 
acknowledgement . 

First we recall shortly, that a TCP flow in the backoff 
phase produces heavy-tailed statistics in the packet in- 
ter arrival timeQ. A TCP in a given backoff state waits 
for a period t = 2''Tjitt between two packet sending 
attempts. The 5-th backoff state occurs only after b suc- 



cessive packet losses. Let's denote the packet retransmis- 
sion probability (effective loss) with p. The probability 
of b consecutive packet retransmission is p^. Hence the 
probability of a silent period t due to backoffs, decays as 
piog2{t/TnTT) _ fi-a^ where a = log2il/2p). 

Next we repeat the main idea of a Markovian chain 
model for backoff states'^, and show, that the statis- 
tics of backoff states delivers the average effective loss 
probability p. 

Let denote the probability Pb that the TCP is in a 
b deep backoff. In a simplified Markovian framework 
one can estimate the Pt by the transition probabilities 
between backoff states as follows (for a detailed ma- 
trix representation see 0. The RTO value is doubled 
if one of two successive packets do not receive ACK 
and is retransmitted. If the retransmission probabil- 
ity is p the transition probability to a deeper backoff is 
l~{l—p)^ = 2p—p^. This yields a backoff probability de- 
cay to be Pb ~ {2p—p^)^ and one can read off the average 
loss probability from the gradient of the semilogarithmic 
plot of Pb versus b. We emphasize here, that the loss 
probability measured by the probability of backoff states 
is the effective loss felt by the TCP. This probability can 
be much larger as the real loss. This is the case in our 
model, since the real loss is below 0.1%, however, the ef- 
fective loss is about 21%. A typical backoff distribution 
for our stochastic model is shown in Fig. 



FIG. 5: Logarithmic plot of the probability distribution of 
backoff states (Pt) as the function of the backoff variable 6 
The fitted line indicates p = 21% loss probability which gives 
a Hurst parameter H = 0.89. 

This gives us the possibility to demonstrate the con- 
nection between long range dependency and the backoff 
distribution. One compares the probability p calculated 
from the backoff statistics and the inter packet arrival 
time decay factor a calculated from the packet traffic 
time series. The two value agrees as a « log2{l/2p), 
hence the long range dependency is caused mainly by 
the backoff mode of the TCP (and not by other external 
reasons as e.g. long range distributed user interaction). 

We have demonstrated the connection between the 
long range dependency, self-similarity and backoff mech- 
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anism. Finally we search for the origins of backing off 
the TCP. 

Our model by construction excludes the known ori- 
gins of self-similarity: the TCP flows follow each other 
immediately and transfer data at a given rate without 
external silent periods as e.g. would be the case with 
user-generated interactive flow. The transfered file sizes 
are constant. The buffer lengths are much above the crit- 
ical chaotic threshold value^6|. The only stochasticity is 
in the switching between the destinations of the TCP 
flows. This irregularity introduces some unpredictability 
in the data flow. If this unpredictability is high, the TCP 
estimation for the available sending rate is false. The 
consequences of unpredictability has been studied from 
many aspects 0, 0|, however all the previous studies re- 
quire a case when the high real loss probability (due to 
small buffers or external loss) hinders the TCP to make 
sufflcient predictions. 

Here we presented a model, where the stochastic choos- 
ing of destination itself pushes TCP into the backoff 
phase and generates self-similar traffic. How can this 
happen? 

TCP operates in backoff, if the ACK packet arrive af- 
ter the limit set by RTO. The RTO value is calculated 
from the traffic continuously, using an average over some 
time-window. If the traffic is bursty, with silent periods 
comparable with size of the averaging window, the TCP 
cannot adapt to the rapid changes in the traffic. In our 
model we detect heavy bursts in the queue lengths in the 
router. Since TCPs changes the destination randomly, it 
can happen, that after a silent period a buffer will be fed 
by one or two TCP. If these TCPs are in slow start, the 
feeding of a buffer can be exponential fast. The queue 
lengths can hence grow very rapidly. If a queue gets 
longer, packets arriving in this queue must wait longer. 
A rapid change in the queue length can cause a so rapid 
change in the round trip time of a packet, that the ACK 
for this packet arrives after the RTO expires. So large 
fluctuations in the queue length (background traffic) can 
cause a series of virtual losses and backing off the TCP. 

In Fig. El we show a typical queue length - time plot, 
where the large fluctuations cause backoff phase in a 
TCP. There is a clear relation between the increasing 
queue length and the evolution of backoff states. 

Since in our model only the heavily fluctuating back- 
ground traffic can back off a TCP, we can conclude to 
identify the fluctuating background as a source of self- 
similarity. This self-similarity is a self-generated one, 
originating from the improper synchronization of hosts, 
which continuously send data to each other by using 
many successive TCP flows. 
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FIG. 6; Queue length in one of the buffers in the router R 
and the backoff variable of the TCPs in a host. The time 
intervals where the queue length changes drastically overlap 
clearly with the backoff phases of the TCP. The inset shows 
a magnification of part of the data. 

CASE STUDY: MEASUREMENT 

In this section we present active measurement results 
which show similar results in a real network environ- 
ment as found in the previous section in a small model 
scenario. The time evolution of a long TCP flow on a 
transcontinental connection was followed on the IP level 
by tcpdump^3| and on the kernel level by a slightly mod- 
iflcd linux kernel from the 2.2.x series. The modifled ker- 
nel gave us the possibility to follow directly the internal 
TCP variables in situ for a real network environment. 

On the transcontinental line chosen for the measure- 
ment typically many TCP connections share the same 
route giving a highly fluctuating background traffic. Ad- 
ditionally on the long line with many routers it is pos- 
sible that the packets of our TCP flow stacks in flUed 
queues. So the round trip time can fluctuate in a very 
wide range resulting many backoff states. Figure [T] shows 
a very congested time interval, where many backoff states 
were observed. Here we mention, that in contrast to the 
TCP implementations of ns-2, the backoff variable b of 
the linux kernel can have larger values than 6. 

As described in the previous section the self-similarity 
is characterized by the Hurst parameter, if the stochastic 
process under consideration is weakly stationary. To sat- 
isfy this condition we restrict our analysis only for some 
parts (time intervals) of the whole measurement. 

In the time range under study the highly congested 
traffic showed self-similar nature. The variance time plot 
for the aggregated time series of packet arrivals is plotted 
in FigurelHl from which we can read off the Hurst param- 
eter 0.69. In Fig. El we show the statistical distribution 
of packet inter arrival times, which show an a = 1.505 
decay giving a similar value for the Hurst parameter as 
calculated from the variance time plot. 

Since we do not have total control over the whole in- 
ternet, we cannot prove rigorously that the observed self- 
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FIG. 7: Backoff states of a TCP, wliidi sends data tlirougli a 
congested transcontinental line. 
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FIG. 8: Variance time plot of measured data in the longest 
stationary time interval. The fitted line indicates H = 0.69. 



similarity is the consequence exclusively of the fluctua- 
tions in the background traffic as it is in the simulation 
scenario presented in the previous section. However it is 
possible to show, that — as in the simulation — there is 
a close relation between the inter packet time statistics 
and the backoff statistics under such conditions where 
the real packet loss is low, indicating self-generated self- 
similarity. 

Here we investigate first, what was the loss rate at 
the line. In end-to-end measurements packet loss can 
be easily detected by analyzing tcpdump data. But to 
gain this direct information about the traffic, one needs 
special rights on the origin of the TCP flow and on the 
destination as well. This ideal condition is given usually 
only for a very restricted number of routes. In most cases 
one can monitor the network traffic only on one side as 
it was the case in our measurement. We applied the 
algorithm of Benko et.al.Q with some improvement to 
detect packet losses from tcpdump data, and to decide if 
the packet is lost really or timeout occurred. 

The algorithm is the following. An effective loss occurs, 
if a packet is resent. A resent packet begins with the same 
serial number as the original packet, so we have to count 
the number of packets, whose sequence number occurred 
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FIG. 9: Statistics of inter arrival times. The fitted line on 
the slow decaying tail with a = 1.505 indicates long range 
dependent trafhc. 



more than once in the TCP flow. We used timestamps 
to avoid the wrapped sequence number problem. 

Detecting real loss events is a bit more tricky. A sent 
packet is probably lost if the TCP receives duplicate ac- 
knowledgement. Duplicate ACKs are sent by the receiv- 
ing TCP if a packet with higher sequence number has 
arrived. However this can happen due to changes in the 
packet order also, therefore the TCP waits a little and 
retransmits the packet only if packet order switching is 
very improbable. Hence for detecting real loss events we 
have to count the number of resent packets, which are 
sent due to receiving of duplicate ACKs. 

Previously we mentioned that the background traffic 
during the whole measurement cannot be approximated 
by weakly stationary stochastic processes and for anal- 
ysis one has to consider only parts of the data. In this 
parts the flow can be characterized by static parameters 
e.g. the loss ratio is constant in time. These intervals 
cannot be too short to have enough data for statistical 
considerations. In Fig. ^| we plot the loss probability 
versus time for the whole measurement. One can see 
long plateaus however there are non-stationary regimes 
as well. In the following we restrict ourself only for the 
longest stationary range. 

We investigated the statistics of the backoff states for 
this time regime from the data logged by the modified 
linux kernel. We found, that the distribution shows an 
exponential decay as it follows from the Markovian de- 
scription presented in the previous section. The Fig. 
shows the decay of the probability of the backoff states. 
The slope of the fitted line indicates a loss probability 
p = 16.5% felt by the TCP. This loss rate is consis- 
tent with the asymptotic decay of the packet inter-arrival 
times (Fig. O and with the Hurst parameter of the ag- 
gregated traffic (Fig.|Sl). 

So the close relation between the backoff states and the 
self-similarity of the traffic holds. The next question is, 
if the TCP is backed off due to real packet losses or the 
packets where only delayed and timed out. In Fig. 1121 we 
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FIG. 10: Real loss and effective loss during the measurement. 
The loss rate changes in time, therefore one has to restrict 
the analysis for the weakly stationary intervals. 



FIG. 12: Effective loss and real loss rate for a time interval. 
The horizontal line shows the loss probability calculated from 
the backoff states. 
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FIG. 11: Logarithmic plot of the backoff value probabilities 
observed in a transcontinental TCP data flow. The fitted line 
indicates a p = 16.5% loss rate felt by the TCP. 



self-generated burstiness evolves to self-similarity how- 
ever the network properties indicate simple, easily pre- 
dictable traffic. 

In the future we focus on the self-generation of the 
burstiness, what are the exact conditions for emergence 
of self-similarity in perfect network. 
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compare the loss ratio from the backoff statistics (16.5%) 
with the loss probability calculated from the tcpdump 
output. We find, that the average loss probability felt 
by the TCP equals with the real plus virtual (effective) 
loss and not with the real loss alone. Here the difference 
between the two type of losses is crucial, since the real loss 
is smaller than 12.5%, the lower bound of loss probability, 
which gives self-similar traffic, but the effective loss is 
higher. 



CONCLUSION 

We have demonstrated in a model network and in a 

real measurement how TCP can generate self-similar traf- 
fic itself. We showed that at very low packet loss rate 
the congestion control mechanism detects false packet 
losses if the background traffic is bursty. On this fluc- 
tuating background traffic TCP responds with backoff 
states. The switching between backoff and congestion 
avoidance phases introduces further fluctuations into the 
overall traffic, which results additional backoffs. This 
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